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Title : Sequencing of Peptides by Mass Spectrometry 

FTFTP OF THE INVENTION 

This invention relates to methods of protein analysis and more 
particularly concerns methods of sequencing oligopeptides. 
5 BACKGROUND OF THE INVENTION 

Identifying the sequence of amino acid residues in a protein is often 
the first step in a study of the function of an unknown protein. This is 
traditionally accomplished by means of Edman degradation (Edman (1970)) 
Despite being a time-honored method, Edman sequencing has a major 

10 disadvantage in that it requires, typically, milligram quantities of a protein, 
an amount often considered excessive in today's laboratory practice and 
standards. Gas-phase microsequencing by means of tandem mass 
spectrometry has increasingly been applied to obtain partial, or even 
complete, sequences of unknown peptides (Hunt et al. (1986); Johnson et al. 

15 (1988); Papayannopoulos (1995)). In the last couple of years, a major 
application of the above has been in the area of proteome analysis, where 
partial sequences of enzymatically cleaved peptides have been used for the 
purpose of protein identification via the so-called "sequence tag" method 
(Shevchenko et al. (1996); Figeys et al. (1996)). Mass spectrometry's major 

20 advantages in chemical analysis have been well-documented (McLafferty 
(1981)) and are applicable to peptide sequencing. 

A protonated peptide, produced by means of electrospray (Fenn et 
al. (1989), fragments typically under low-energy (< 100 eV) tandem mass 
spectrometric conditions to yield product ions that are indicative of the 

25 amino acid residue sequence (Hunt et al. (1986); Johnson et al. (1988); 
Papayannopoulos (1995)). In solution, the most basic sites on a peptide are 
the nitrogen atoms at the N-terminal amino group and the side chains of the 
histidine, lysine, and arginine residues (Cantor et al. (1980); Smith et al. 
(1991)). However, once the protonated peptide ion has been desorbed into 

30 the gas phase, transfer of the "external" proton, particularly after collisional 
activation, to amidic functional groups, the carbonyl oxygen and the amidic 
nitrogen atoms, on the backbone becomes possible (McCormack et al. 
(1993); Jones et al. (1994); Dongre et al. (Mass Spectrom. 1996); Dongre et al. 
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(/. Am. Chem. Soc. 1996). The peptide then fragments at the protonated 
peptide linkage to yield an ionic and a neutral fragment. If the charge is 
retained on the N-terminal fragment, a b ion is produced; the b ion may 
subsequently lose CO to form the a ion. Alternatively, if the charge is 
5 retained on the C-terminal fragment, a y" (y + 2H) ion is produced (Hunt et 
al. (1986); Johnson et al. (1988); Papayannopoulos (1995); McCormack et al. 
(1993); Jones et al. (1994); Dongre et al. (Mass Spectrom. 1996); Dongre et al. 
(/. Am. Chem. Soc. 1996). Results from a series of studies have shown that 
the b ion has an oxazolone structure (Yalcin et al. (1995); Yalcon et al. (1996); 
10 Ambihapathy et al. (1997); Nold et al. (1997)), the a ion is an immonium ion, 
and the y" ion is a protonated peptide or amino acid (Hunt et al. (1986); 
Johnson et al. (1988); Papayannopoulos (1995); McCormack et al. (1993); 
Jones et al. (1994); Dongre et al. (Mass Spectrom. 1996); Dongre et al. (/. Am. 
Chem. Soc. 1996). A product-ion spectrum of a protonated peptide typically 
15 consists of series of b, a, and y" ions which reflect the amino acid sequence. 
Unfortunately, the fragmentation chemistry of a peptide is rich, and there 
may be gaps in the series of product ions; this makes sequencing nonroutine 
to all but the highly experienced. 

The fragmentation of alkali- and alkaline-metal containing peptides in 
20 tandem mass spectrometry has been a subject of much interest and study 
(Renner et al. (1988); Grese et al. (1989); Grese et al. (1990); Hu et al. (1992); 
Hu et al. (J. Am Soc. Mass Spectrom. 1993); Hu et al. (/. Am. Chem. Soc. 1993); 
Tang et al. (1988); Teesch et al. (1990); Teesch et al. (/. Am. Chem. Soc. 113, 
812-820 (1991)); Teesch et al. (J. Am. Chem. Soc, 113, 3668-3675 (1991); Zhao 
25 et al. (1993); Leary et al. (1989); Leary et al. (1990); Lee et al. (1998)). 
Although most of these reports concentrated on examination of the 
structures of metal-containing product ions and the mechanisms of their 
formation (Renner et al. (1988); Grese et al. (1989); Grese et al. (1990); Hu et 
al. (1992); Hu et al. (J. Am Soc. Mass Spectrom. 1993); Hu et al. (J. Am. Chem. 
30 Soc. 1993); Tang et al. (1988); Teesch et al. (1990); Teesch et al. (J. Am. Chem. 
Soc. 113, 812-820 (1991)); Teesch et al. (/. Am. Chem. Soc, 113, 3668-3675 
(1991); Zhao et al. (1993); Leary et al. (1989); Leary et al. (1990); Lee et al. 
(1998)), the potential for acquiring sequence information from 
metal-containing, e.g., lithiated and sodiated, peptides was readily apparent 
35 irrespective of whether that was implied or explicitly stated (Leary et al. 
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(1989) ). Unlike protonated peptides, lithiated and sodiated peptides tend to 
fragment after collisional activation to yield primarily the [b„ + OH + X]+ 
ions (X = Li or Na) (Renner et al. (1988); Grese et al. (1989); Grese et al. 

(1990) ; Tang et al. (1988); Teesch et al. (/. Am. Chem. Soc, 113, 3668-3675 
5 (1991); Lee et al. (1998)). These product ions are believed to form from 

elimination of C-terminal residues from precursor ion structures in which 
the alkali-metal ion is bound to the C-terminal carboxylate anion of a 
zwitterionic peptide. Rearrangement as a result of collisional activation 
results in transfer of the C-terminal OX group to the carbonyl carbon of the 

10 preceding residue and elimination of CO and an imine (Renner et al. (1988); 
Grese et al. (1989); Grese et al. (1990); Lee et ala. (1998)). Unfortunately, the 
affinities of peptides for the lithium and the sodium ions are weak relative to 
those for the proton (Bouchonnet et al. (1992); Klassen et al. (1996)); 
consequently, cleavage of the metal ion is often a more favorable 

15 fragmentation pathway relative to backbone fragmentation and leads to 
low yields in these more sequence-informative reactions. 
STTIVTMARY OF THF TNTVFNTION 

The present inventors have found that they are able to determine the 
sequences of peptides or proteins by analysing the peptides or proteins in 

20 argentinated form using mass spectrometry. 

Accordingly the present invention provides a method of analyzing 
argentinated peptides or proteins using mass spectrometry comprising: 

(a) combining an oligopeptide with silver to provide a sample 
comprising argentiated oligopeptide; 

25 (b) submitting the sample to a mass spectrometer; 

(c) performing scans of silver containing peaks in optimum 

collision energies; 

(d) identifying any doublet or triplet peak pattern; 

(e) confirming with Y ions; 

30 (f) determining partial sequence by the mass separation between 

two successive doublet or triplet patterns. Preferably the oligopeptide 
comprises from about 3 to about 10 amino acids and the performing scans 
comprise collecting product ion spectra of the [M + Ag] + ion, where M = 
oligopeptide; 

35 According to a preferred embodiment, the method of the invention 



utilizes silver nitrate. 

According to another embodiment of the method of the invention 
according the determination of partial sequence comprises searching for, 
and identifying cleaved amino acid residues based on differences in m/z 
5 values of neighboring triplets where the m/z value of the [b n - H + Ag] + ion 
and the corresponding [y„ + H + Ag] + ion are related by the formula: [y n + 
H + Ag] + = [M + Ag] + + Ag+ - [b„ - H + Ag]+. Preferably the searching and 
identifying is conducted by a custom search algorithm, more preferably the 
algorithm is written in Visual Basic and looks for the triplet peak pattern of 
10 (m/z) lf (m/z) 2 ) = {m/z) x - 18.0, and (m/z) 3 = {m/z\ - 28.0 as well as the doublet 

pattern of (m/z) 2 and (m/z)3, all to within ± 0.5 m/z unit. 

According to yet another embodiment of the method the product ion 
spectra of the [M + Ag] + ion are collected under f^s, of 1.5, 2.0, 2.5 and 3.0 
eV. 

15 According to a further embodiment of the invention the method the 

mass spectrometer of the method is a triple quadrupole mass spectrometer, 
two triple quadrupole mass spectrometers, a quadrupole /time-of- flight 
mass spectrometer, an ion-trap mass spectrometer, or a time-of-flight mass 
spectrometer amenable to post-source decay or collision-induced 

20 dissociation. 

According to another embodiment the method according to the 
present invention comprises: 

(a) combining an oligopeptide with silver nitrate in solution; 

(b) submitting a sample of the solution to a mass spectrometer; 
25 (c) collecting product ion spectra of the [M + Ag] + ion, where M = 

oligopeptide; 

(d) identifying the triplet peak pattern; 

(e) identifying the doublet peak pattern; 

(f) searching for, and identifying cleaved amino acid residues 
30 based on differences in m/z values of neighboring triplets where the m/z 

value of the [b n - H + Ag] + ion and the corresponding [y n + H + Ag] + ion are 
related by the formula: [y n + H + A^] + = [M + Ag]+ + Ag+ - [b n - H + Ag]+. 
Preferably the oligopeptide comprises from about 3 to about 10 amino 
acids. 
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According to a preferred embodiment according to this method the 
searching and identifying is conducted by a custom search algorithm, 
preferably the algorithm is written in Visual Basic and looks for the triplet 
peak pattern of {m/z) v (m/z) 2 ) = {m/z\ - 18.0, and (m/z) 3 = (m/z) 2 - 28.0 as 

5 well as the doublet pattern of (m/z) 2 and (m/z)3, all to within ± 0.5 m/z unit. 
More preferably the product ion spectra of the [M + Ag] + ion are collected 
under £ cm s, of 1.5, 2.0, 2.5 and 3.0 eV. Preferably the mass spectrometer 
used according to the method of the invention is a triple quadrupole mass 
spectrometer or two triple quadrupole mass spectrometers, however, any 

10 mass spectrometer capable of tandem mass spectrometry, such as a 
quadrupole /time-of-f light mass spectrometer, an ion-trap mass 
spectrometer, or a time-of-flight mass spectrometer amenable to post- 
source decay or collision-induced dissociation, may be used. 

Other features and advantages of the present invention will become 

15 apparent from the following detailed description. It should be understood, 
however, that the detailed description and the specific examples while 
indicating preferred embodiments of the invention are given by way of 
illustration only, since various changes and modifications within the scope of 
the invention will become apparent to those skilled in the art from this 

20 detailed description. 

DESCRIPTION OF THF. DRAWINGS 

The invention will now be described in relation to the drawings in 

which: 

Figurel shows product ion spectra of the [M + 107 Ag] + ion of leucine 
^ a J)\ 25 enkephalin, ^GGFL, at different E cm s: (a) 1.5 eV, (b) 2.0 eV, (c) 2.5 eV, and 
$V \ (d) composite \ (a) to (c). b* = [b„ + OH + Ag]+, b = [b„ - H + Ag]\ and a = 
[a M - H + Ag]+. 

Figure 2 snows a product-ion spectrum of the [M + 107 Ag] + ion of 
leucine enkephalin\YGGFL, at a linear E cm function from 2.5 eV for m/z = 30 

30 to 1.5 eV for m/z = 6t 

Figure 3 show\ a product-ion spectrum of the [M + 107 Ag] + ion of 
GGEGG at an E cm of 2.0yeV. 

Figure 4 shows the composite product-ion spectrum of the [M + 
i09 Ag ]+ ion of bradykinin,VpPGFSPFR, at E cm s of 2.0 and 2.5 eV. 
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1\ Figure 5\hows a product-ion spectrum of the [M + ^7 Ag ] + ion of 

^ynorphin A fragment 1-7, YGGFLRR, at E cm = 1.9 eV. 

Figure 6 shows the product-ion spectra of tryptic peptides at linear 
E cm functions (a) from 4.6 to 1.8 eV and (b) from 4.0 to 1.6 eV. 
5 Figure 7 shows a custom search algorithm of the invention written in 

Visual Basic. 

DETAILED DESCRIPTION OF THF. INVENTION 

As mentioned above, the present inventors have found that they are 
able to determine the sequences of peptides or proteins by analysing the 
10 peptides or proteins in argentinated form using mass spectrometry. In its 
broad aspect the invention provides a method of analyzing argentinated 
peptides or proteins using mass spectrometry comprising: 

(a) combining an oligopeptide with silver to provide a sample 
comprising argentiated oligopeptide; 
15 (b) submitting the sample to a mass spectrometer; 

(c) performing; scans of silver containing peaks in optimum 

collision energies; 

(d) identifying any doublet or triplet peak pattern; 

(e) confirming with Y ions; 

20 (f) determining partial sequence by the mass separation between 

two successive doublet or triplet patterns. 
Mass Spectrometry 

Although the data presented in the examples described herein were 
conducted on two triple quadrupole mass spectrometers on two triple 

25 quadrupole mass spectrometers, any mass spectrometers that are capable of 
tandem mass spectrometry, such as a quadrupole /time-of-f light mass 
spectrometer, an ion-trap mass spectrometer, or a time-of-flight mass 
spectrometer amenable to post-source decay or collision-induced 
dissociation, may also be used, with appropriate adjustments known to 

30 those skilled in the art. The ions may be generated by a number of methods 
including electrospray, MALDI (matrix-assisted laser desorption ionization) 
and FAB (fast-atom bombardment). The method of the invention can be 
applied to oligopeptides of any length, preferably the number of amino acid 
residues is between 3 and 20. 

35 Where prepared oligopeptides of unknown sequence are to be 
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analysed they are put into solution in 50/50 water/methanol with a 5:1 ratio 
of moles of silver : moles of peptide. The exact composition of the solution 
and the mole ratios are not crucial. The sequencing will work for solutions 
of pure methanol and pure water, and mole ratios of 1:1 to 100:1. Proteins 
5 of unknown sequence are preferably digested by methods known to those 
skilled in the art. For example, trypsin or other active protease is reacted 
with an optimum amount of protein substrate and digested to completion. 
As for oligopeptides sufficient silver (e.g., silver nitrate) is added to provide 
adequate binding for analysis. 
10 Sequencing 

The silver (I) ion is the preferred ion to be used as it exhibits relatively 
higher affinities than alkali-metal ions for nitrogen- and sulfur-containing 
small ligands that are representative of the functional groups that exist in 
amino acids and peptides (Narula et al. (1991); Stillman et al. (1994); Deng et 

15 al. (1998); Li et ala. (1997); Lee et al. (J. Am. Soc. Mass Spectrom. 1998); Lee et 
al. (/• Am. Chem. Soc. 1998)). While not preferred, alkali-metal ions in the 
method of the invention are included within the scope of the invention. 
Also within the scope of the invention are chemical equivalents of the silver 
ion, i.e., chemical compounds which are able to provide comparable results 

20 with those achieved with silver. As used herein, "silver" includes any source 
of silver, or source which is able to provide a silver-ion or chemical 
equivalent. 

Electrospraying a peptide solution containing silver (I) typically yields 
an intense [M + Ag] + ion, which fragments efficiently under low-energy 

25 collision conditions to yield intense product ions that result from backbone 
fragmentation (Li et al. (1997); Lee et al. (/. Am. Chem. Soc. 1998)). These are 
the following: [b„ - H + Ag] + , [a„ - H + Ag] + , [y„ + H + Ag]+ , and [b„ + OH + 
Ag] + ions. The [b„ - H + Ag]+ ion has been determined to be an 
N-argentinated oxazolone, whereas the [a„ - H + Ag]+ an an N-argentinated 

30 immonium ion (Lee et al. (J. Am. Chem. Soc. 1998)). A fraction of the 
fragmenting [b„ + OH + Ag] + has been postulated to have a silver salt 
structure in which the Ag + ion is bound to the carboxylate anion of the 
zwitterionic peptide, although the most stable [b„ + OH + Ag] + structure is 
nonzwitterionic; the [y„ + H + Ag] + s an argentinated amino acid or 
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truncated peptide (Chu et al. (not published)). For identical peptides, the 
product ions of [M + Ag] + are more intense than those of [M + Na]+ , and 
the relative intensities of the [b„ + OH + Ag] + ions are typically lower than 
those of the [b„ + OH + Na] + ions (Chu et al. (not published)). These 
5 observations are in line with the softer properties of Ag+ relative to Na + , 
and may be interpreted to result in relatively less prominent binding to the 
harder carboxylate terminus and more prominent binding to the softer 
N-terminal and amidic nitrogen atoms (Lee et al. (/. Am. Chem. Soc. 1998); 
Chu et al. (not published)). Accordingly, with an appropriate choice of 

10 collision energies, the [b„ + OH + Ag] + , [b n - H + Ag] + , and [a„ - H + Ag]+ 
ions of the same n can often be made to have abundances within a factor of 
5 from one another (together with typically less abundant [y M + H + Ag]+ 
ions). From a sequencing point of view, this is convenient as the triplet 
peaks of [b„ + OH + Ag]+ ,[b„ - H + Ag]+ , and [a w - H + Ag]+ ions are 

15 separated by a constant 18 and 28 m/z units. The triplets are easily 
recognizable in a search algorithm that identifies peaks of 18 and 28 m/z unit 
separation; the difference in m/z values of neighboring triplets identifies the 
amino acid residue that is cleaved. However, as will be readily understood 
by those skilled in the art, the invention is not so limited and includes within 

20 its scope recognition of any doublet or triplet pattern and searching on this 
basis. 

Accordingly, in order to obtain sequence information, product-ion 
spectra of the [M + Ag] + ion collected under several , (for examples 1.5, 
2.0, 2.5, and 3.0 eV, although other values may be used, values are chosen to 

25 provide as wide a range of abundant product ions as possible) and are 
summed to yield a composite spectrum. Such spectra exhibit a wide range 
of abundant product ions. Alternatively, product-ion spectra are acquired 
with an m/z dependent £ cm function (typically linear) to maximize the 
number of observable sequence-relevant product ions. The peak abundance 

30 file of the spectrum for sequencing, after elimination of peaks below a 
certain user-defined threshold, typically 10%, are read into a custom search 
algorithm written in Visual Basic (see Figure 7) which looks for the triplet 
peak pattern of {m/z\, (m/z) 2 ) = {m/z) x - 18.0, and (m/z) 3 = (m/z) 2 - 28.0 as 
well as the doublet pattern of (m/z) 2 and (m/z) 3 , all to within ± 0.5 m/z unit. 



Triplet and doublet peak patterns that are found are imported into any 
commercially available spreadsheet program, for example, Excel. A 
user-defined threshold is used to filter out noise. This user-defined 
threshold can be from 1 to 20 % of maximum however, a preferred 
5 threshold is 10% of maximum. The differences in m/z values of neighboring 
triplets are used to search for cleaved amino acid residues or combinations 
of them by means of AminoCal, a shareware available on the World Wide 
Web (http://www.protana.com/software/default.asp). The use of the 
spreadsheet program and AminoCal facilitates the sequence assignment, 

10 but is in no way crucial to this invention. For peptides that have C-terminal 
lysine and arginine residues, such as tryptic peptides, their product-ion 
spectra also contain prominent [y„ + H + Ag] + ions that may be used to 
confirm residues cleaved from the C terminus. A m/z value of the [b n - H + 
Ag] + ion and that of a corresponding [y„ + H + Ag] + ion resulting from 

15 cleavage of the same peptide bond are linked by the following relationship: 
[y„ + H + Ag] + = [M + Ag] + + Ag+ - [b„ - H + Ag]+ . The observation of the 
corresponding [y„ + H + Ag]+ ions increases the confidence of the 

assignment for these peptides. 

Having generally described the invention, the same will be more 
20 readily understood through reference to the following examples which are 
provided by way of illustration, which are not intended to be limiting of the 
present invention. 

EXAMPLES 

^ENTERAL METHOD AND MATERIALS FO R THF EXAMPLES 
25 Mass Spectrometry. Experiments were conducted on two PE SCIEX triple 
quadrupole mass spectrometers, an API 365 and an API 3000 prototype 
(Concord, Ontario). All but one oligopeptide were commercially available 
(Sigma, St. Louis, MO). Samples were typically 2 /iM peptides in 50/50 
water /methanol containing 10 nM silver nitrate (Aldrich, St. Louis, MO). 
30 These were continuously infused by means of a syringe pump (Harvard 
Apparatus, Model 22, South Natick, MA) at a typical flow rate of 2 jiL/min 
into the pneumatically assisted electrospray probe with air being the 
nebulizing gas. The optimum probe position was established from time to 
time but was typically with the tip about 2 cm from the interface plate and 
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with the spray off-axis from the orifice. Mass spectra were acquired in the 
positive ion detection mode with unit mass resolution at a step size of 0.1 
m/z unit and at a dwell time of 10 ms/step. Typically, 10 scans were summed 
to produce a mass spectrum. Tandem mass spectrometry was performed 
with a nitrogen pressure of approximately 3 mTorr in q2 and at a collision 
energy at the center-of-mass frame (E cm ) typically of 1-3 eV. 

Bovine cytochrome c and ubiquitin were digested in the following 
manner (Matsudaira (1993)). A 1-mg quantity of protein was dissolved in 
100 ju.L of 8 M urea; 40 fiL of 0.8 M ammonium carbonate was added, 
followed by 22 iiL of 2 mg/mL trypsin. The solution was incubated at 37°C 
for 24 h. Water was added to make up the volume to 1 mL. Ten microliters 
of glacial acetic acid was added. The tryptic digest was subsampled; silver 
nitrate was added, to result in a final concentration of approximately 10 ixM. 
Two nL of the sample was loaded via a GelLoader tip (1-10 fiL r Eppendorf, 
Hamburg, Germany) into a nanospray probe (Wilm et al. (1994)) (Protana, 
Odense, Denmark) for electrospraying and sequencing. 
Sequencing. For sequencing, product-ion spectra of the [M + Ag] + ion were 
collected under £ cm = 1.5, 2.0, 2.5, and 3.0 eV. The values were summed to 
yield a composite spectrum. These spectra exhibited a wide range of 
abundant product ions. As an alternative approach, product-ion spectra 
were acquired with a linear m/z-dependent £ cm function to maximize the 
number of observable sequence-relevant product ions. For sequencing, the 
peak abundance file of the spectrum were treated to eliminate peaks below 
a certain user-defined threshold. These treated spectra were read into a 
custom search algorithm written in Visual Basic which looked for the triplet 
peak pattern of (m/z) v (m/z) 2 ) = {m/z\ - 18.0, and (m/z) 3 = (m/z) 2 - 28.0 as 
well as the doublet pattern of (m/z) 2 and (m/z)3, all to within ± 0.5 m/z unit. 
Triplet and doublet peak patterns that were found were imported into Excel. 
A user-defined threshold (typically 10% of maximum) was used to filter out 
noise. The differences in m/z values of neighboring triplets were used to 
search for cleaved amino acid residues or combinations of them by means 
of AminoCal. \ 

However, manual sequencing beyond this limit is possible in some 
favorable cases although the emphasis is the automated search 
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characteristics of the triplet pattern of product-ion peaks. 
EXAMPLE 1 

Figure\l shows that the product-ion spectra of the [M + 107 Ag] + ion 
* «A (silver has two\stable isotopes, 107 Ag and 109 Ag, of almost equal abundance; 
^ 5 the product-ioAspectra shown in this article will be those of either one of 
the two isotopes^ of leucine enkephalin, YGGFL, collected under different 
E cm s: (a) 1.5, (b) 21), and (c) 2.5 eV; (d) is the composite of (a), (b), and (c). It is 
well-known that Wdividual product-ion yield is strongly dependent on 
collision energy (Dawson et al. (Org. Mass Spectrom. 17, 205-211 (1982); 
10 Dawson et ala. (Or\ Mass Spectrom. 212-217 (1982)); a necessary step in 
sequencing is to acquire product-ion spectra under several collision energies 
to find the best spectraVfor sequencing. Summing the spectra to produce a 
composite provides a convenient way of presenting a minimal number of 
searchable mass spectra to the triplet/ doublet identification algorithm. An 
15 alternative method to generate a wide range of searchable product ions is to 
acquire a product-ion spectrum with m/z-dependent E cm function. Figure 2 
shows such a product-ion speWum also for the [M + 107 Ag] + ion of leucine 
enkephalin. It was acquired with a linear £ cm function from 2.5 eV for m/z = 
30 to 1.5 eV for m/z = 663, bu\ otherwise under the same experimental 
20 conditions as those in Figure 1 (th\ scan time equalled the total scan time of 
the three individual £ cm s). It is re\dily apparent that the spectra are not 
identical (due to differences in exa^ collision energies), but the overall 
spectral quality of the two is similar. 

Figures Id and 2 are of such quality that the amino acid sequence of 
25 leucine enkephalin can be read directly. The search algorithm looks for a 
triplet pattern of peaks, whose members are 18 and 28 m/z units apart, and 
assigns these as [b n + OH + Ag] + , [b„ - H + Ag] + , and [a„ - H + Ag] + , in 
decreasing m/z values (the product ions are labeled as b*, b, and a, 
respectively, in the figures). The amino acid residues that are cleaved from 
30 the C terminus can be determined from the differences in the m/z values of 
the [M + Ag] + ion and the appropriate triplet members. For example, in 
Figure 2 the first triplet that may be identified starting from m/z 662.2 is m/z 
548.9, 531.2, and 503.1 (differences of m/z values, Ara/z = 17.7 and 28.1). The 
peak with the largest m/z value in the triplet is the [b„ + OH + Ag]+ ion, 
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which is a truncated peptide after cleavage of the C-terminal residue (Chu et 
al. (not published)). The difference in m/z values between the [M + Ag] + ion 
and the [b„ + OH + Ag] + ion is 662.2 - 548.9 ) 113.3, thus identifying leucine 
or isoleucine (theoretical Am/z = 113.1) as the C-terminal residue that is 
5 cleaved. The second triplet that is identifiable is m/z 402.1, 384.0, and 356.1 
(Am/z = 18.1 and 28.3). The difference in m/z values between the [b n + OH + 
Ag] + ion of the first triplet and the [b n + OH + Ag] + ion of this second triplet 
(i.e., the [b M -l + OH + Ag] + ion) is 548.9 - 402.1 = 146.8, which identifies the 
residue preceding the C terminus as phenylalanine (theoretical Am/z = 
10 147.1). The third triplet identified is m/z 345.0, 327.0, and 299.1 (Am/z = 18.0 
and 27.9). The difference between the m/z value of the [b„_! + OH + Ag] + ion 
of the preceding triplet and that of the [b„_ 2 + OH + Ag]+ ion of this triplet is 
402.1 - 345.0 = 57.1, which identifies the third residue from the C terminus as 
glycine (theoretical Am/z = 57.0). The triplet pattern disappears on further 
1 5 decreasing m/z, and the search algorithm reaches its application limit. 

The last triplet peak identified has a m/z value of about 300. This 
signifies it is likely the first two amino acid residues from the N terminus for 
m/z < 300, which means that the third peak of the last triplet, m/z = 299.1, is 
the [a 2 - H + Ag] + ion. Cleavage of the peptide linkage between the 
20 N-terminal and the second residues produces the n = 1 series of product 
ions; however, the [b 1 - H + Ag]+ ion does not typically exist as most b ions 
are oxazolones, which are composed of a minimum of two residues, and the 
[bj + OH + Ag]+ ion is almost always absent (Yalcin et al. (1995); Yalcin et al. 
(1996); Ambihapathy et al. (1997); Nold et ala. (1997); Lee et al. (J. Am. Chem. 
25 Soc. 1998). The only abundant product ion typically seen from the cleavage 
between the N-terminal and the second residues is the [a x - H + Ag] + ion, 
which is assigned to the peak at m/z = 242.1. The difference in m/z values 
between the [a 2 - H + Ag] + and the [a t - H + Ag]+ ions is 299.1 - 242.1 = 57.0, 
which identifies the second residue (or the fourth residue from the C 
30 terminus) as, again, glycine. The identity of the N-terminal residue is found 
by subtracting the difference between the m/z values of the silver isotope 
and the proton, 106.9 - 1.0 = 105.9 in this example, from the m/z value of the 
[a 1 -H + Ag] + ion, 242.1 - 105.9 = 136.2, which identifies it as the immonium 
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ion of tyrosine (theoretical Am/z = 136.2). Thus, the determined amino acid 
sequence of leucine enkephalin is Y-G-G-F-L/I, which is correct. 
EXAMPLE 2 

jure 3 shows a product-ion spectrum of the [M + 107 Ag] + ion of 
GGEG^b, glycylglycylglutamylglycylglycine. Sequencing of the argentinated 
peptide\is straightforward and similar to that discussed in Example 1 for 
leucine enkephalin. The difference in m/z values between the [M + Ag]+ ion 
and the first [b n + OH + Ag] + ion is 482.1 - 425.1 = 57.0, which identifies the 
C-termina\residue as glycine (theoretical Am/z = 57.0). The difference in m/z 
10 values betAeen the first [b „ + OH + Ag] + and the second [b „ + OH + Ag] + is 
425.1-368.0 \ 57.1; this identifies the residue preceding the C-terminus as, 
again, glycink Repeating the procedure yields the next residue, glutamic 
acid (experimental Am/z = 128.9 versus theoretical Am/z = 129.0). The triplet 
peak pattern\ disappears beyond glutamic acid; however, manual 
15 interpretation ieveals further sequence information. The [a x - H + Ag] + in 
this product-ion\spectrum is weak. However, the small m/z value of the [a„ 
- H + Ag]+ ion oAthe last triplet, 193.1, strongly suggests that it is the [a 2 - H 
+ Ag] + ion. This rnakes G-G as the only possible option for the N-terminal 
and second residues (theoretical m/z = 193.0). Furthermore, Figure 3 shows a 
20 small peak at m/z\ = 136.0, which is assignable as the [a 2 - H + Ag] + ion 
(theoretical m/z = 136.0, assuming the first residue is glycine). Thus, the 
determined sequence is G-G-E-G-G. 
EXAMPLE 3 

The following\example relates to a longer peptide bradykinin. A 
25 product-ion spectrum bf the [M + i09Ag] + ion of bradykinin, RPPGFSPFR, is 
shown in Figure 4. Sequencing of the [M + H] + ion of bradykinin is 
considered difficult bWause the external proton is believed to be 
sequestered by the highl\basic guanidine groups on the side chains of the 
two arginine residues, thus rendering it unavailable for binding to the 
30 amidic functional groups Vnd inducing charge-proximal fragmentation 
along the peptide backbone (Alexander et al. (1990); Tang et al. (1993); Burlet 
et al. (1992); Cox et al. (1996)VSummerfield et al. (1997)). The Ag + ion, 
however, appears to bind to maW different sites on the peptide, as evident 
from the relative richness of the fragmentation pattern in Figure 4. Table 1 
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summarizesVhe triplets found and the residues identified using the above 
approach. It\s apparent that only a partial sequence of five residues starting 
from the C-tefcrrinal end of the peptide has been solved - FSPFR; the major 
advantage oAsequencing argentinated peptides relative to protonated 
peptides is thi triplet relationship which greatly facilitates product-ion 
assignment. Rradykinin has the highly basic arginine as its C-terminal 
residue, which Also binds strongly to the silver ion (Lee at al. /. Am. Soc. Mass 
Spectrom. (199$)). In fact, it is our observation that peptides that have 
C-terminal methionine, lysine, and arginine residues tend to yield relatively 
strong [y n + H + Ag] + product ions (Li et al. (1997); Chu et al. (not 
published)). In the search algorithm, presence of the corresponding [y„ + H 
+ Ag] + ion is usld as confirmation of the cleaved residue. For bradykinin, 
the [y n + H + Agr ions for n = 3-5 have been observed (Figure 4 and Table 
1), which confirm! results of the triplet search. In Figure 4, the triplet signal 
corresponding to lleavage of the proline residue is weak; this is actually a 
confirmation of thi proline-residue assignment. Proline is the only residue 
from which an oxalolone [b„ - H + Ag] + cannot be formed and of which the 
relatively weak [b M -tti + Ag] + ion is believed to be a ketene (Lee et al., J. Am. 
Chem. Soc. (1998)). The assignment of proline can often be confirmed by the 
presence of the appropriate [y n + H + Ag]+ ions or the identification of the 
next triplet; the difference in the m/z values of the [b„ + OH + Ag] + ions for n 
= 1 and n = 3 is 1010.3 -^66.5 = 243.8, which can only mean a combination of 
cleavage of phenylalanine and proline. 
EXAMPLE 4 

Figure 5 shows a pVoduct-ion spectrum of the [M + !07Ag]+ ion of 
dynorphin A fragment l-A YGGFLRR, another highly basic peptide. The 
triplets and the residues identified using the search algorithm are tabulated 
in Table 2. Again, the partial sequence determined, - -FLRR, is confirmed by 
the presence of the appropriate^y n + H + Ag] + ions. 
EXAMPLE 5 

A frequent and valid criticism of the above type of examples is that it 
is relatively easy to sequence a peptide whose identity is known a priori. One 
of us (IKC) was presented with tryptic digests of proteins unknown to him. 
He was asked to sequence a number of the tryptic peptides from 
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approximately 5 fih of digests using nanospray ionization. Table 3 shows 
the sequences of the tryptic peptides determined using the search algorithm 
of the present invention and the sequences subsequently revealed to the 
experimenter; in all cases, the partial sequences determined were correct. 
5 Figure 6 shows two of the product-ion spectra obtained to illustrate the 
quality of the sequence data; the presence of the three triplet series of peaks 
plus the [y„ + H + Ag]+ series facilitate sequencing and increase the 

confidence in assignment. 
DTSniSSTON OF FXAMPLES 

10 The greatest potential for application of sequencing argentinated 

oligopeptides that we envisage is in the area of automated sequence tag 
analysis in proteomics, where accurate, automatic peak, and subsequently 
residue, assignment can be made because of the presence of four series of 
peaks, the [b„ + OH + Ag]+ , [b„ - H + Ag]+ , [a„ - H + Agl+ , and [y M + H + 

15 Ag]+ ions, and identification of only three or four residues are needed for 
often unambiguous searches (Shevchenko et al. (1996); Figeys et al. (1996)). 
Indeed, inputting the partial sequences shown in Figure 6, together with the 
bracketing m/z values of the b ions for the corresponding protonated 
product ions (for Figure 6a, they are 335.2 + 1 - 108.9 = 227.3 and 610.1 + 1 - 

20 108.9 = 502.2; thus the sequence tag search parameters are 
227.3/F/A/G/502.2 for this tryptic peptide) plus the peptide mass (the m/z 
value of the precursor ion minus 108.9) and the molecular mass of the 
protein (the m/z value of the [M + i09Ag] + ion of the protein prior to trypsin 
digestion minus 108.9, which was measured in a separate experiment and 

25 provided to the analyst) unambiguously identifies the protein as bovine 
ubiquitin after a search using Peptide Scan (PE SCIEX) with sequence index 
files (database) downloaded from the European Molecular Biology 
Laboratory (ftp : / / f tp .embl-heidelber g. de /pub / databases / nr db / ) . The 
results are summarized in Table 4. All the proteins are ubiquitin; 

30 swiss/p02248, bovine ubiquitin, has the best match with the measured 
protein molecular mass (Mr) of 8565 Da and has the correct sequence as 
subsequently revealed. 

While the present invention has been described with reference to 
what are presently considered to be the preferred examples, it is to be 
35 understood that the invention is not limited to the disclosed examples. To 




- 16- 

the contrary, the invention is intended to cover various modifications and 
equivalent arrangements included within the spirit and scope of the 
appended claims. 

All publications, patents and patent applications are herein 
5 incorporated by reference in their entirety to the same extent as if each 
individual publication, patent or patent application was specifically and 
individually indicated to be incorporated by reference in its entirety. 
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Table 1 



Sequencing Results for Bradykinin 

[b„+OH+Ag]+ [b„-H+Ag]+ [a„-H+Ag]+ residue residue confirmed 
triplet (m/z) (m/z) (m/z) cleave assigned [y„+H + Ag]+ 



1168.4a 


NA b 


NA 


156.1 


R 


1012.3 


994.3 


966.3 


147.0c 


F 


865.0 


847.1 


819.9d 


97.2 


P 


768.1 


750.2 


722.2 


87.0 


S 


681.2 


663.1 


635.2 


147.0 


F 


534.2 


516.2 


488.2 







5272. 
614.2 
761.3 



a [M + i09Ag]+ b Not applicable, c [(1012.3-865.0) + (994.3 - 847.1) + (966.3 - 
819.9)] /3. d Weak [b„ - H + Ag]+ and [a„ - H + Ag]+ ions due to proline residue (see 
text for details). 
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Table 2 

Sequencing Results for Dynorphin A Fragment l-7a 

triplet [ b „ [b„-H+Ag]+ [ a - „ residue residue confirmed 

+OH+Ag]+ (m/z) H+Ag] + cleaved assigned [y„+H+Ag] + 
(m/z) (m/z) 

0 974 5 NA NA 156.0 R 281.0 

1 818.5 800.0 772.5 156.3 R 437.0 

2 661.5 644.0 616.5 113.0 L/I 550.0 

3 549.0 531.0 503.0 147.0 F 

4 402.0 384.0 356.0 



a See Table 1 for explanation of details. 
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fable 3 



Sequencing o\ Tryptic Peptides 



peptide 

1 
2 
3 
4 
5 



determined sequence 

-FAGK 
-(L/I)FVI 
-TGK 

-DVEK 
-V(Q/K)Ka 



actual sequence 

LIFAGK 
MQIFVK 
TLTGK 
GDVEK 
IFVQK 



a K and Q are isobaric; however, since trVpsin cleaves only on the C- 
terminal side of K and R, the identity olthe C-terminal residue is 
unambiguous, but that of preceding residues is not. 



• 
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Table 4 



Sequence Tag Analyses of Tryptic Peptides 
(Protein Molecule Mass: 8565 Da) 



(a) 



Peptide mass: 647.2 Da (756.1- 
108.9) 

Sequence tag: 227.3/FAG/502.2 

(335.2 + 1 - 108.9 = 227.3; 

610.1 + 1 - 108.9 = 502.2) 

Peptide mass: 764.4 Da (873.3 - 



Protein 



Mr 



swiss | P02248 
spit|Q17001 
spit | Q41405 
1 1 M21581 



8564.87 



8563.89 



8562.83 



8566.91 



1 1 M22218 



8566.91 



108.9) 

Sequence tag: 260.1 /L (or I) 
FV/619.1 

(368.0 + 1 - 108.9 = 260.1; 
727.0 + 1 - 108.9 = 619.1) 

Swiss | P02248, bold sections correspond to the tryptic peptides entered 
for the search and identification 



MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQLE 
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